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Abstract 



Sturmian sequences are well-known as the ones having minimal complexity over a 2-letter 
I alphabet. They are also the balanced sequences over a 2-letter alphabet and the sequences 

describing discrete lines. They are famous and have been extensively studied since the 18th 
century. One of the extensions of these sequences over a fc- letter alphabet, with fc > 3, are 
• the episturmian sequences, which generalizes a construction of Sturmian sequences using the 

I palindromic closure operation. There exists a finite version of the Sturmian sequences called 

the Christoffel words. They are known since the works of Christoffel and have interested 
many mathematicians. In this paper, we introduce a generalization of Christoffel words for 
an alphabet with 3 letters or more, using the episturmian morphisms. We call them the 
I epichristoffel words. We define this new class of finite words and show how some of the 

■ properties of the Christoffel words can be generalized naturally or not for this class. 

!> ■ 

1 Introduction 

CD \ As far as we know, Sturmian sequences first appeared in the literature at the 18th century in the 

precursory works of the astronomer Bernoulli [Ber72]. They later appeared in the 19th century 
in Christoffel [Chr75] and Markov [Mar82] works. The first deep study of these sequences is 
given in [MH38, MH40] where the name Sturmian sequence appears for the first time. At the 
^ ' end of the 20th century and more recently, many mathematicians have been interested in those 

sequences, for instance [CH73, Cov75, Sto76, Bro93, BPR94, Zic95, dL97a, Ber02]. Recent 
books also show this interest [Lot02, PF02, AS03, BLRS] as well as a recent survey [Ber07]. In 
this wide literature, we find different characterizations of the Sturmian sequences. In particular, 
they are the sequences over a 2-letter alphabet having the minimal complexity, they also are 
the balanced sequences over a 2-letter alphabet and they code discrete lines. These different 
characterizations show how the Sturmian sequences occur in different fields as number theory 
[Mor85, Sim91, TijOOb, TijOOa, BV03, Sim04, GO05], discrete geometry, crystallography [BT86] 
and symbohc dynamics [MH38, MH40, Hed44, Que87]. 

Since the end of the 20th century, numerous generalizations of Sturmian sequences have been 
introduced for an alphabet with more than 2 letters. Among them, one natural generalization 
is called the episturmian sequences and is using the palindromic closure property of Sturmian 
sequences [dL97b]. The first construction of episturmian sequences is due to [DJPOl]. Previ- 
ously the first introduction and study of an episturmian sequence was that of the Tribonacci 
word [Rau82] and an important class of episturmian sequences, now called the Arnoux-Rauzy 
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sequences, had been considered in [AR91, RZOO]. More recently the whole class was exten- 
sively studied, for instance in [JVOO, RZOO, DJPOl, JP02, JP04, Jus05, Gle07, PV07, Ric07b, 
Gle08, BdLLZOS, GJP08, GLR08]. For surveys about episturmian sequences, see for instance 
[Ber07, GJ09]. 

The finite version of Sturmian sequences, called Christoffel words, has been also well studied 
[Chr75, Lot02, BR06, BdLR07, KR07]. It is known that any finite standard Sturmian word, that 
is the words obtained by standard Sturmian morphisms to a letter, is conjugate to a Christoffel 
word. A Christoffel word is then the smallest word, with respect to the lexicographic order, 
in the conjugacy class of a finite standard Sturmian word. Finite factors of the episturmian 
sequences appeared for instance in [GJP08]. The class of standard episturmian words is natu- 
rally defined as the set of finite words obtained by standard episturmian morphisms to letter, 
but no generalization of the Christoffel words have been introduced yet. In this paper, we 
introduce such a generalization that we naturally call the epichristojfel words. Note that it 
naturally appears that for each standard episturmian word, there exists a conjugate which is 
an epichristoffel word, and conversely. 

The paper is organized as follows. 

We first recall some basic definitions of combinatorics on words and we establish the 
notation used in this paper. We recall the definitions and some properties of the Sturmian 
sequences, the Christoffel words and the episturmian sequences. Then we introduce our new 
class of finite words: the epichristojfel ones. We prove how some of the properties of the 
Christoffel words can be generalized for an alphabet with more than 2 letters. We then describe 
an algorithm which determines if a given fc-tuple describes the occurrence numbers of letters 
in an epichristoffel word or not. If so, we show how to construct it. Finally, we prove the next 
theorem, which is a generalization of a result for Christoffel words [dLD06], that characterizes 
epichristoffel conjugates. 

Theorem Let w be a finite primitive word different from a letter. Then the conjugates of w 
are all factors of the same episturmian sequence if and only ifw is conjugate to an epichristoffel 
word. 

2 Definitions and notation 

Throughout this paper, A denotes a finite alphabet containing k letters ao,ai, . . . ,ak-i. A 
finite word is an element of the free monoid A*. If w = w[0]w[l] ■ ■ ■ w[n — 1], with w[i] £ A, 
then w is said to be a finite word of length n and we write \w\ = n. By convention, the empty 
word is denoted e and its length is 0. We define A^ the set of right infinite words, also called 
sequences, over the alphabet A and then, A°° = A* U A^ is the set of finite and infinite words. 

The number of occurrences of the letter Oj in w is denoted \w\ai. The reversal of the word 
w = w[0]uj[l] ■ ■ ■ w[n — 1] is = w[n — l]w[n — 2] • • • w[0] and if w = w, then w is said to be a 
palindrome. A finite word / is a factor of i« € A°^ if w = pfs for some p G A*, s G A^. If p = e 
(resp. s = e), f is called a prefix (resp. a suffix) of w. Let u G A* and n G N. We denote by 
u" the word u repeated n times and we called it a n-th power word. A factor a'' of the word w, 
with a G A and fc G N locally maximum, is called a block of a of length k in w. Let u, v be two 
palindromes, then u is a central factor of f if u = www for some w G A*. The right palindromic 
closure of w G A* is the shortest palindrome u = w^~^^ having w as prefix. 

The set of factors of w £ A^ is denoted F{w) and Fn{w) = F{w) n is the set of all 
factors of w of length n G N. The complexity function is given by P{n) = \Fn{w)\ and is the 
number of distinct factors of w of length n G N. Two words w and w' are said equivalent if they 
have the same set of factors: F{w) = F{w'). 
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The conjugacy class [w] ofw& is the set of all words • • • l]ii;[0] • ■ ■ 1], 

for 0<i<n — 1. Ifwis not the power of a shorter word, then w is said to be primitive and has 
exactly n conjugates. If w is the smallest of its conjugacy class, relatively to some lexicographic 
order, then w is called a Lyndon word. 

Let w be an infinite word, then a factor / of w is right (resp. left) special in w if there exist 
a,b & A, a ^ b, such that fa, fb G Flw) (resp. af, bf G F^w)). A word w over A is balanced if 
for all factors u and v w having the same length, for all letters a A, one has 

\\u\a - \v\a\ < 1- 

If w = pus E A'^, with p,u ^ A* and s G A^, then p~^w denotes the word us. Similarly, 
ws~^ denotes the word pu. 

An integer p € N is a period of the word w = w[0]w[l] ■ ■ ■ w[n — 1] G A* if w[i] = w[i + p] 
for < i < n — p. When p = 0, the period is trivial. If p is the smallest non trivial period of 
w, then the fractionnary root of w is defined as the prefix Zy^ of w of length p. An infinite word 
w € A^ is periodic (resp. ultimately periodic) if it can be written as w; = n'^ (resp. w = vu^), 
for some u,v ^ A*. If w is not ultimately periodic, then it is aperiodic. A morphism f from 
A* to A* is a mapping from A* to A* such that for all words u,v e A*, f{uv) = f{u)f{v). A 
morphism extends naturally on infinite words. 

3 Sturmian, ChristofFel and episturmian words 

Before introducing our generalization of Christoffel words, inspired by the definition of epis- 
turmian sequences, let us recall the definition of these well-known families and some of their 
properties. 

3.1 Sturmian words and morphisms 

One of the classical definitions of Sturmian sequences is the one given by Morse and Hedlund 
[MH40]: 

Definition Let p, called the intercept, and a, called the slope, be two real numbers with a 
irrational such that < a < 1. For n > 0, let 

J a if \a{n -|- 1) -|- pj = [an + p\ , 
1^ b otherwise, 

J a if \a{n -|- 1) -|- p] = \an + p\ , 
b otherwise. 

Then the sequences 

= s[0]s[l]s[2] • • • and <p = s'[0]s'[l]s'[2] • • • 

are Sturmian and conversely, a Sturmian sequence can be written s' „ for a irrational 

and p e M. 

Sturmian sequences have several characterizations. For more details about this class of 
words, we refer the reader to the section in [Lot02] devoted to Sturmian sequences. 

Proposition [CH73] A sequence s is Sturmian if and only if for all n G N, P{n) = n + 1. 



s[n] = 
s'[n] = 
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Theorem 1 [Lot02] Let s be a sequence. The following assertions are equivalent: 

i) s is Sturmian; 

ii) s is balanced and aperiodic. 

Definition 2 [Lot02] A morphism f is Sturmian if f{s) is Sturmian for all Sturmian se- 
quences s. 

3.2 Christoffel words 

In discrete geometry, Christoffel words are defined as the discretization of a line having a 
rational slope, as introduced in [BL93]. In symbolic dynamics, they are defined by exchange of 
intervals [MII40] as follows. 

Definition Let p and q be positive relatively prime integers and n = p + q. Given an ordered 
2-letter alphabet {a < b}, the Christoffel word w of slope p/q over this alphabet is defined as 
w = w\fd\w\i] ■ ■ ■ w[n — 1], with 

r.-, { a \i ip mod n > (i — l)p mod n, 
\ b ii ip mod n < [i — l)p mod n, 

for < i < n — 1, where k mod n denotes the remainder of the Euclidean division of k by n. 

Notice that since p and q are relatively prime, a Christoffel word is always primitive. Other 
important properties of Christoffel words will be recalled just before their generalizations in 
Section 4. 

3.3 Episturmian sequences and morphisms 

One of the possible generalizations of Sturmian sequences for an alphabet with 3 letters or more 
is the set of episturmian sequences. Let us first recall the definition of standard episturmian 
sequences as introduced initially by Droubay, Justin and Pirillo. 

Definition 3 [DJPOl] A sequence s is standard episturmian if it satisfies one of the following 
equivalent conditions. 

i) For every prefix u of s, u^'^^ is also a prefix of s. 

ii) Every leftmost occurrence of a palindrome in s is a central factor of a palindromic prefix 
of s. 

iii) There exist a sequence uq = e,ui,U2,... of palindromes and a sequence A(s) = 
x[0]x[l] • • • , with x[i] G A, such that n„ defined by Un+i = {unx[n])^~^\ with n > 0, 
is a prefix of s. 

Definition 4 [DJPOl] A sequence t is episturmian if F{t) = F{s) for a standard episturmian 
sequence s. 

An equivalent definition is that a sequence s G A'^ is episturmian if its set of factors is closed 
under reversal and s has at most one right (or equivalently left) special factor for each length. 

Notation 5 [Jus05] Let w = w[0]w[l] ■ ■ ■ w[n — 1], with w[i] £ A, and uq = £,..-, Un = 
{un~iw[n — the palindromic prefixes of Then Fal{w) denotes the word m„. 
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In Definition 3, A(s) is called the directive sequence of the standard episturmian sequence s. 
Since A(s) is the limit of its prefixes and s is the limit of the n„, it is natural to write s = 
Pal(A(s)). 

Let us recall from [Jus05] a useful property of the operator Pal. 

Lemma 6 [Jus05] Let x £ A, w G A* . If\w\x = 0, then Fal{wx) = Fa\{w)xPal{w) . Otherwise, 
we write w = wixw2 with \w2\x = 0. The longest palindromic prefix ofVal{w) which is followed 
by X in 'Pal{w) is Pal(t(;i). Thus, Pal(u;a;) = Pal(if)Pal(i(;i)~^Pal(tt;). 

Definition 7 For a, 6 G we define the following endomorphisms of A*: 

i) V'a(a) = V'a(a) = a; 

ii) Tpa{x) = ax, if X € ^ \ {a}; 

iii) ipai^) — ''^^ X £ A \ {a}; 

iv) 9ah{a) = b , 9abib) = a, 6ab{x) = x, x e A\{a, b}. 

The endomorphisms and ^p can be naturally extended to a finite word w = 

w[0]w[l]---w[n^ 1]. Then = V^iup] (' • • (V'«;[n-i] (a)) ' ' ' )) and ip^{a) = 

V't«[o](V'«;[i](- • • (V'«,[n- !](«)) ■■■)), with aeA. 

Similarly to the Sturmian morphisms, we can define the episturmian morphisms as follows. 

Definition 8 [JP02] The set of episturmian morphisms is the monoid generated by the 
morphisms tpaii^ay^ab under composition. The set ^ of standard episturmian morphisms is the 
submonoid generated by the ipa and Bab', the set of pure episturmian morphisms is the submonoid 
generated by the V'a and ip^. 

As the Sturmian morphism, the episturmian ones have the following characteristic property: 
a morphism / is episturmian if /(s) is episturmian for any episturmian sequence s. 

4 Epichristoffel words 

In this section, we generalize Christoffel words to a fc-letter alphabet and we call this general- 
ization epichristoffel words. 

Let us first recall some properties of Christoffel words that will be used to define their 
generalization. 

Lemma 9 [BdL97] A word w is a Christoffel word if and only if w is a balanced Lyndon word. 

The next proposition follows from Seebold, Richomme, Kassel and Reutenauer works [See96, 
See98, RicOTa, KR07] and is proved in [Chu99]. 

Proposition 10 Christoffel words and their conjugates are exactly the words obtained by the 
application of Sturmian morphisms to a letter. 

Lemma 9 and Proposition 10 have for consequence the following corollary. 

Corollary 11 In the conjugation class of a Christoffel word, the Lyndon word is the Christoffel 
word. 

Note that Corollary 11 is the result we will extend as a definition of epichristoffel words. 
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Definition 12 A finite word w (z A* belongs to an epichristoffel class if it is tlie image of a 
letter by an episturmian morphism. 

Definition 13 A finite word w S A* is epichristoffel if it is the unique Lyndon word occurring 
in an epichristoffel class. 

In the sequel, a word in an epichristoffel class will be called c- epichristoffel, for short. The 
following result insures that the epichristoffel classes are well-defined. 

Proposition 14 Let w and w' be conjugate finite words. Then w = (j){u) and w' = (j)'(u'), with 
(j), 4>' € {tpa, V'a}; f^i" u, u' £ A* , a £ A if and only if u and u' are conjugate. 

Proof. 

Without loss of generality, we can suppose that cj) = cj)' = -0^, since tpaiw) = ail)g^{w)a~^ 
and so, ipaiw) is conjugate to tpai''^) ^-^y word w. Thus, we can write w = 
a^°v[0]a"^v[l]- ■ ■ a"''^v[k], with v[i] ^ a and > for < i < k. Since w = ipaiu), 
using injectivity of ipa, we have u = a'^°~^v[0]a^'^~^v[l] ■ ■ ■ a^''~^v[k]. Since w and w' are 
conjugate, we can write w' = a'^v[i]a"'*+'^v[i + !]••• a^'^^f [i — l]a^, with a + /? = and 
Q > 1. Thus, u' = a'^~^v[i]a"''-+'^~^v[i + !]••• d"''--'^~^v[i — l]a^ . Comparing u and u', we 
conclude that u is conjugate to u' . 

(<^^) If u and u' are conjugate, then there exist v, t such that u = vt and u' = tv. Applying 
respectively the morphisms (p over u and u' , we obtain (/>(m) = (j)(v)4>{t) and 

4>'{u') = 4>' {t)(t)' [y). li (j) = (f)' the result follows. Otherwise, let us suppose 4> = il^a and 
(f)' = ■0^. Then we conclude using the fact that tpa{u) = ail) ^{u)a~^ : 

Ipaiu) = a'ijj ^{v)a~^ aip{t)a~^ = aipa{v)'4'a{'t)C'~^ ■ 

u 

The finite factors of episturmian sequences, also called finite Arnoux-Rauzy words, have 
already been studied. In [JP02], the authors used a subclass of c-epichristoffel words without 
mentioning that it is a generalization of Christoffel words. In their paper, they denoted by hn, 
the standard episturmian words, that is the words obtained by the application of standard epis- 
turmian morphisms to a letter. The c-epichristoffel words are exactly the set of all conjugates 
of the standard episturmian words and the smallest one in the conjugacy class is epichristof- 
fel. Notice that they form a subclass of the Arnoux-Rauzy word, since they all are factor of 
episturmian sequences, but any factor of episturmian sequence is not necessarily obtained by 
an episturmian morphism to a letter. For instance, the word abacab aabac ababacabaabacaba ■ ■ ■ 
contains the finite Arnoux-Rauzy word aabac which is not c-epichristoffel. 

In [JP02], the authors proved the 2 following properties. 

Proposition 15 ([JP02], prop. 2.8, prop. 2.12) Every standard episturmian word is primitive 
and can be written as the product of 2 palindromic words. 

It is clear that any standard episturmian word is conjugate to an epichristoffel word. Propo- 
sition 14 can be used to show the converse. Consequently, Proposition 15 can be generalized 
for any c-epichristoffel word, using the following lemma. 

Lemma 16 ([DJPOl], Lemma 3) The word u £ A* is a palindrome if and only if ijja{u)a and 
a0„(n) are so, a £ A. 
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Proposition 17 Every c-epichristojfel word is primitive and can be written as the product of 
2 palindromic words. 

Proof. By induction over the number of morphisms. For a single morphism applied over a 
letter, we get w = ab, with a,b ^ A and a ^ b, which is the product of two palindromes. 
Let us suppose that for a c-epichristoffel word there exist palindromic words u, v such that 
w = uv. Let X = ipdw) = ipc{uv) = '0c(^^)V'c(i') (resp. x = ipci''^) = V'c('")V'c('^))5 c € A. 
Then x = iljc{u)cc~^'ipc{v) (resp. x = f.{u)c~^ cijj i,{v)) , where 'ipc{u)c, c'^ipdv) (resp. ipdu)c~^, 
cip^v)) are palindromic words by Lemma 16. ■ 

Let now show how some of the properties of Christoffel words can be generalized to 
epichristoffel words. 

Recall that for Christoffel words, we have: 

Theorem 18 [dLD06] Let w be a non empty finite word. The following conditions are equiva- 
lent: 

i) w is a factor of a Sturmian sequence; 

ii) the fractionnary root of w is conjugate to a Christoffel word. 

First, note that the equivalence in Theorem 18 cannot be generalized to epichristoffel words. 
Indeed, let us consider the episturmian sequence 

s = aabaacaabaacaabaabaa ■ caabaacaabaaa ■ ■ ■ 

Then w = caabaacaabaaa is a factor of s, but its fractionnary root = w is not c-epichristoffel, 
as we will see later in Example 27. 

On the other hand, the converse holds for episturmian sequences and epichristoffel words. 

Theorem 19 Let w be a non empty word such that its fractionnary root is c- epichristoffel. 
Then w is a factor of an episturmian sequence. 

Proof. Let w = z^, with k > 1 G Q, z^ the fractionnary root of w. Let us suppose that z^ 
is c-epichristoffel. Thus there exist x ^ A* and a G A such that (f>^^^(f>^^^ ■ ■ ■ (a) = z^, with 
0^*^ G {^x[i],?x-[i]}- Then u- is a factor of zj?^ = (</.(0) • • • (a)) T'^l = 0(0)0(1) • • • 0(")(ar*^T). 
It is sufficient to take an episturmian sequence having a^^^^ as a factor and apply the morphism 
0(0)0(1) . . . 0(n). obtain that 0(0) 0(i) • ■ ■ 0(")(ar'=l) is a factor of an episturmian sequence and 
so is w. m 

Proposition 20 Let w A* be a c-epichristoffel word. Then, the set of factors of length < \w\ 
of its conjugacy class is closed under mirror image. 

Proof. First note that the set of factors of length < \w\ of the epichristoffel class of w is the 
same as the one of w'^. Since any c-epichristoffel word w is the product of 2 palindromes (by 
Proposition 17), let w = piP2, with pi, p2 palindromes. Then w'^ = P1P2P1P2 and it follows that 
w = P1P2 = P2P1 is a factor of w'^. Thus, the mirror image of any factor of w is also a factor of 
w"^ and consequently, is in the epichristoffel class of w. m 

Remark 21 The right palindromic closure of a c-epichristoffel word is often a prefix of w'^, but 
it is not the case in general. It suffices to take the word w = abcbab for which w^~^^ = abcbab-cba. 
For Christoffel words, we have: 
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Lemma 22 [dLM94]74 Christoffel word can always he written as the product of two Christoffel 
words. 

But: 

Lemma 23 An epichristojjel word cannot always be written as the product of two epichristoffel 
words. 

Proof. It is sufficient to consider the epichristoffel word aabacab. The only decompositions in 
c-epichristoffel factors are a ■ abacab and aab ■ acab, but abacab and acab are not Lyndon words, 
assuming a < b < c. m 

Lemma 24 Any c- epichristoffel word having length > 1 can be non-uniquely written as the 
product of two c-epichristoffel words. 

Proof. For the non unicity, it is sufficient to consider the example of the word aabacab 
given in the proof of Lemma 23. By definition, any c-epichristoffel word can be written as 
(^(0)^(1) . . .,^("-i)(a), with a e A, £ {ipw[i\,i^w[il}^ w £ and w[n - 1] / a. Assume 
^(n-i) _ To prove the existence of the product, it is then sufficient to consider the 

words (/)(°)(/)(^) • • • 0("-i)(tt;[n - 1]) and • • • (a), since 

0(o)^(i)...^(n-i)(«) = </,(o)0(i)...</>("-2)(u;[n-l]a) 

= 0(0) 0(1) .. . 0(«-2) _ 1] ) . ^(0) ^(1) . . . _ 

The case 0("~i) = V^uifn-i] is analogue: we would have obtained a conjugate. 



5 Epichristoffel /c-tuples 

Recall from [BL93] that for a given {p,q), with p, g € N, there exists a Christoffel word with 
occurrence numbers of letters p and q if and only if p and q are relatively primes. Moreover, it 
is possible to construct the corresponding Christoffel word, using a Cayley graph (see [BR06]). 

In this section, we give an algorithm which determines if there exists or not an epichristoffel 
word w over the alphabet A = {oq, ai, . . . , Ok-i} such that p = {po,Pi, ■ ■ ■ ,Pk-i) withpj = \w\ai, 
for < i < A; — 1. If so, we also give an algorithm that constructs it. 

Definition 25 Let p = {pQ,pi, . . . ,pk-i) be a /c-tuple of non negative integers. Then the 
operator T iN^ ^ l)^ is defined over the fc-tuple p as 

/ .-1 \ 

^(p) =T{pQ,p\,...,Pk-\) = (po,Pi, • • • \vi- XI ^Vi+u---.Vk-\). 

\ j=o,j^i J 

where pi > pj, Vj ^ i. 

Proposition 26 Letp be a k-tuple. There exists an epichristoffel word with occurrence numbers 
of letters p if and only if iterating T over p yields a k-tuple p' with p'- = for f ^ m and p'^ = 1, 
for a unique m such that < m < k — 1. 

The idea of using the operator T comes from the algorithm computing the greatest common 
divisor of 3 integers as described in [CMR99] and of the tuples described in [JusOO]. 



8 



Example 27 There is no epichristoffel word with the occurrence numbers of letters (2, 2, 9). In- 
deed, T(2, 2, 9) = (2, 2, 5), T\2, 2, 9) = T{2, 2, 5) = (2, 2, 1), T^{2, 2, 9) = T{2, 2, 1) = (2, -1, 1). 
On the other hand, the 6-tuple q = (1, 1, 2, 4, 8, 16) does so: 

r(l,l,2,4,8,16) = (1,1,2,4,8,0) 

r2(g) = T(l,l,2,4,8,0) =(1,1,2,4,0,0) 

T\q) = r(l, 1,2, 4, 0,0) =(1,1,2,0,0,0) 

T^{q) = T(l, 1, 2, 0, 0, 0) = (1,1,0,0,0,0) 

T\q) = r(l, 1,0, 0,0,0) =(1,0,0,0,0,0). 

Some lemmas are required in order to prove Proposition 26. 

Lemma 28 Let w = 4>{u), with (p G {^ao; "^aoi' ~ i^o, «!)•••, CLk-i} and u £ A* . Then 
k-l 

E 



k-l 



i=0 

ii) \w\an = \u\a,, + ^ \W\ 



O- i ■ 



Proof. The first equality comes from the definition of ipao and il^ao- each letter a ^ oq, 
''Paaioi) = a^a, ip^^ = aao and V'ao ~ V'ao(o^o) = ^0^ ^ adds as much oq as the occurrence 
numbers of the other letters in the word u. The second equality follows from the first one, since 
\w\ai = \u\ai for i^O. m 

Lemma 29 Let w G A* be a c- epichristoffel word. Then, there exist a c- epichristoffel word 
u G A* , \u\ > 1 and an episturmian morphism (p € {"^ao; V'aoi' ^^^^ "0 ^ A, such that w = (j){u) 
if and only if \w\aQ > \w\ai for all ai £ A, i ^ 



Proof. 

{==^) By contradiction. Let us suppose there exists u with \u\ > 1 such that w = <j){u) and \w\ao 
is not maximum. Then, there exists at least one letter ai £ A such that \w\a. > l^i'lao- 
Without loss of generality, let us suppose that i = 1. By Lemma 28, \w\ao = Xlto^ I^Iq^ — 
Wlao + I^^Ui + Xyi=2 I'^lo.i that implies jii'lao ~ I'^^Ui ~ I'^^Uo 

+ I]i=2 \u\ai < 0, which is 

possible only if \u\a^ = for all z 7^ 1 and then |w|ai = I^Uo- Hence, we would have 
that u = ai" and w = (/)(ai"). The only possibility is that n = 1, since a c-epichristoffel 
word is primitive. Then \u\ = 1: contradiction. Hence, if u; = (j){u), with \u\ > 1, \w\ao is 
maximum. 

(<^) Let us now suppose that |w|ao > \w\ai for all Oi £ A, i ^ 0. Since w is c-epichristoffel, 
there exist an episturmian morphism <j) G {ipai^i^ai} and a c-epichristoffel word u £ A* 
such that (/>(ti) = w. Let us suppose that i ^ 0. Using Lemma 28, \w\a^ = \w\ao + \u\ai + 

Si<i<fc-ij¥il"'l%- ^^^"^ '"^'"o > Hai, it implies that \u\a^ + J2i<j<k-i,j^i\Ma, < 0, 
which is impossible. Thus i = 0. m 



An interesting consequence of Lemma 29 is the following. 
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Proposition 30 Let u and v be c-epichristojfel words. If \u\a = \v\a for all a A, then u and 
V are conjugate. In other words, a k-tuple of occurrence numbers of letters determines at most 
one epichristoffel conjugacy class. 

Proof. By induction. The result is true when \u\ = \v\ < 2. Assume by now that \u\ > 3. By 
definition of epichristoffel words, there exist letters a and b, and epichristoffel words u',v' such 
that u = 4>iu'), V = (l)'{v'),(j) £ {V'ajV'a} 'P' £ {V'bjV'fe}- Fiom \u\ > 3 and definitions of 
morphisms tpa, ipa^ i^b, V'b) we get \u'\ > 2, |?;'| > 2. Prom Lemma 29 and the fact that \u\a = \v\a 
for all letters a, it comes that a = b (and \u\a = \v\a > \u\a = \v\a for all letters a). Now from 
definition of u' and v' and properties of u and v, we deduce that \u'\a = \v'\a for all letters a. 
By inductive hypothesis, u' and v' are conjugate. Proposition 14 allows to conclude. ■ 

The algorithm induced by the iteration of Lemma 29 leads to a construction of words which 
are images of a letter by an episturmian morphism, that is c-epichristoffel words. Indeed, iterat- 
ing T gives a construction of an c-epichristoffel word with p describing the occurrence numbers 
of letters. We take p as the initial fe-tuple. The iteration over p of the operator T described 
previously yields a finite sequence of fc-tuples We do as in Proposition 26, 

applying the operator T and moreover, we keep an important information that allows us to 
construct the word: the letter with maximal number of occurrences. Let 



denote the relation T{p^^^) = p^^'^^\ where p^ is the maximal integer of p^^\ 

(r) (r) 

Then, performing T until p- = for all i except for one V-i for which p- = 1, we get 
the sequence of fc-tuples 

p(0) % pil) % pii) % ... ilzl^ pir^l) hzl^ 

Then, 

is a c-epichristoffel word having p as occurrence numbers of letters, with a the letter such 
that p\ =1. The epichristoffel word is the Lyndon word of the conjugacy class of the word 
obtained. Here, Proposition 30 insures that it is sufficient to consider the standard episturmian 
morphism in order to construct a c-epichristoffel word with p describing the occurrences of the 
letters. 

Proof of Proposition 26. Follows directly from Lemmas 28, 29 and from the ideas described in 
the previous paragraph. The only difficulty concerns the last iteration, that is when w = 4>{u), 
with \w\aQ not maximum. As seen in the previous proof, it implies that u = ai and w = (f>{ai) £ 
{aoai,oiao}, which is clearly a c-epichristoffel word. Notice here that ipao{o-i) = V'ai('^o) and 
i'aoiO'i) = V'ai(ao) are conjugate. ■ 

Example 31 For the triplet (5, 10, 16) describing the occurrence numbers of respectively the 
letters a, b and c, the sequence obtained is 

(5, 10, 16) ^ (5, 10, 1) i (5, 4, 1) ^ (0, 4, 1) ^ (0, 3, 1) ^ (0, 2, 1) ^ (0, 1, 1) ^ (0, 0, 1). 
Performing the algorithm, we find the word 
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tpcbabbbb{c) = tpcbabbbibc) 

= i'cbabbiAibc)) 

= -fpcbabitpbibbc)) 

= i'cbaiAibbbc)) 

= ipcbitpaibbbbc)) 

= ilJcii^bio-bababahac)) 

= jpc{babbabbabbabbabc) 

= cbcacbcbcacbcbcacbcbcacbcbcacbc. 

Since it is obtained by a standard episturmian morphism to a letter, this standard episturmian 
word is a representant of the epichristoffel conjugacy class. Moreover, its conjugate which is a 
Lyndon word, and so, an epichristoffel word, is acbcbcacbcbcacbcbcacbcbcacbc ■ cbc for the order 
a < b < c. 

Note that in the previous example, the choice of the last transition is arbitrary: we could 

c b 

have chosen the transition (0, 1, 1) — > (0, 1, 0) instead of (0, 1, 1) — » (0, 0, 1) and we would have 
obtained a conjugate of ipcbabbbbic) which is also c-epichristoffel. 

6 Criteria to be in an epichristoffel class 

Let us recall a characterization of words in the conjugacy class of a Christoffel word. 

Theorem 32 [dLD06] Let w € A* be a primitive word. Every conjugate w' is a factor of a 
Sturmian sequence, not necessarily the same, if and only if w is conjugate to a Christoffel word. 

The goal of this section is to prove the following generalization of Theorem 32. 

Theorem 33 Let w be a finite primitive word different from a letter. Then there exists an 
episturmian sequence z such that all the conjugates of w are factors of z if and only if w is a 
c-epichristoffel word. 

Note that in order to generalize Theorem 32 to a /c-letter alphabet, fc > 3, an additional 
condition is necessary: the conjugates must be factor of the same episturmian sequence. For 
example, every conjugates of the word abc are factors of episturmian sequences, but abc is not 
a c-epichristoffel word, since T(l, 1, 1) = (1, 1, —1). 

Let us recall the following results of Justin and Pirillo that allow us to write any episturmian 
sequence as the image by an episturmian morphism of an other episturmian sequence. 

Corollary 34 [JP02] Let s G and A = 2:[0]x[l]x[2] • • • , x[i] G A. Then s is a standard 
episturmian sequence with directive sequence A if and only if it exists an infinite sequence of 
sequences s^^^ = s,s^^\.s^'^\ . . . such that for any z G N, = ipx[i\{s^'^'') ■ 

It can also be generalized to non standard episturmian sequences. In order to do so, let us 
recall what is a spinned word. Let A = {a\a G A}. A letter x is considered as x with spin 1 
while x itself is considered as x with spin 0. Then, an infinite spinned word s = s[0],s[l]s[2] • • • 
is an element of {A U A)^ . 

Theorem 35 [JP02] A sequence t G A^ is episturmian if and only if there exist a spinned 
sequence A = x[0]x[l]5;[2] • • • , x[i] G {A U A} and an infinite sequence of recurrent sequences 
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t(0) = t, . . . such that for i G N, t^*-^) = if has spin (resp. (*^*'') if 

x\i\ has spin 1). Moreover t is equivalent to the standard episturmian sequence with directive 
sequence A = x[0]a;[l] • • • . 

Theorem 35 allows us to write the directive sequence of a non standard episturmian sequence, 
as we do in the following lemma. 

Lemma 36 Let A(s) = {d)^bz be the directive sequence of an episturmian sequence s, with 
a^h ^ A and z E . Then the blocks of c^ a have length 1 and the blocks of a's have length 
£, k or {k + 1), where £ < k + 1 is the length of the block of a 's prefix of the sequence. 

Proof. Let us consider the equivalent standard episturmian sequence t directed by A(t) = a^bz. 
By Corollary 34, t = ■ip^ki,{t') for a standard episturmian word t'. Since il^g^kf^{a) = a^ba, 
ipa''bi^) — '^^^ ™d for c ^ {a, 6}, il^a'^hi^) — of^ba^c, the statement is true for t. Since the langage 
of s and t are equals, it only remains to consider the prefix of s where a block of length < k can 
appear. Indeed, for the episturmian sequence s, since it is directed by A(s) = {a)^bz^ we easily 
deduce that s begins by a prefix of a's of length I equals to the number of a having spin in 
the prefix {a)^ of its directive sequence, which is less or equal to k. ■ 

Remark 37 An episturmian sequence may not have blocks of a's of length (A; + 1). It is the 
case if its directive sequence has the form af^z, with \z\a = 0. 
One can be easily convinced of the following statement. 

Lemma 38 In an episturmian sequence w = il^ait) or w = ipa{t), any letter different from a is 
preceded and followed by the letter a, except for the first letter of the sequence, if it is different 
from a. 

Lemma 39 Let z = tpao (0 be a standard episturmian sequence and w = a^yai a factor of z, 
with oq ^ ai ^ A and y ^ A* . Then, there exists a factor u of t such that ipaoiu) = w. 

Proof, li z = Tpaoi't)^ i = ••• and Card(^) = k, then by the definition of ip, z = 

'V^ao(*[0])V'ao(^[l]) ■ ■ ■ ^ {oo, aooi, aoa2, . . . , aoafc_i}'^. Since w starts with and ends by ai, 
then any factor w of z = ipaoit) can be written as w £ {ao, aoai, 0002, . . . , aoa^.i}*. Thus we 
can construct a word u by associating to aoaj the letter Oi for i 7^ and to ao the letter ao. 
Thus, w is the image of the word u by the morphism ipao ■ ■ 

Proposition 40 Let z = ipa{t), where t and z are standard episturmian sequences. Let w be a 
factor of z not power of a letter, such that \w\ > 1 and all its conjugates are also factors of z. 
Then, there exists a factor u of t such that w = ipa{u) or w = ipg^(u). 

Proof. Let (z A, with 7 7^ a, y € A* and w factor of z. There are 4 cases to consider. 

i) vu = (iy^: its conjugate y^[5 is not a factor of z, since any occurrence of the letter [j is 
preceded by the letter a, by Lemma 38. Then w does not satisfied the hypothesis. 

ii) w = ay (3: by Lemma 39, there exists u factor of t such that ipa{u) = w. 

iii) w = (3ya: symmetric to the case ii). \i w = (3ya is a factor oi z = il^a{t) and satisfies the 
hypothesis, then there exists u factor of t such that ipa{u) = w. 
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iv) w = aya: rewrite w = a^y'a^, with m, n >1 and m, n maximum. The factor y' is not 
empty, since w is supposed not to be a power of a letter. Let us suppose that there exists 
(3 ^ A, j3 ^ a such that w(3 = a'^y'a^l3 is a factor of z. Since by Lemma 36 any block of 
a has length k or {k + 1), for some A; G N \ {0}, we have that n = koT:n = k + \. On 
the other hand, by the hypothesis, the conjugate y'a^'^'^ ol w = a^y'a^ is also a factor 
of z. Thus m + n < A; + 1. But since m ^ 0, the only possibility is that n = k and m = 1. 
Consequently w = ay'a^ . Its conjugate y'a^~^^ is also a factor of z and since y' does not 
start by a by the maximality of m, it should be preceded by a: ay'al''^^ = ay'af^a = wa 
is a factor of z. Since z is episturmian, wa factor of z implies that there exist ^ € N and 
j3 ^ a G A such that wa^j3 is so. By Lemma 39, there exists a word u' = ua^~^(3 such 
that ipaiu') = wa^(5. Since ipai^^'^P) = cl^P, w = ipaiu). 



We can now prove our main Theorem. 



Proof of Theorem 33. 

{=^) i) Let us suppose that all conjugates of w are factor of a standard episturmian sequence 
z = ipait)- We proceed by induction on the number of morphisms. Since z = ipa{t), 
by Proposition 40, there exists u such that w = tpaiu) or w = ipai''^)- Let us now 
prove that all conjugates u' of u are also factors of t. Since n, u' are conjugate, using 
Proposition 14, we have tpa{u') is a conjugate of tpaiu). Hence, again by Proposition 
40, there exists a factor u" of t with tpa{u') = ipaiu") or V'a(ii') = V'aC^")- The 
second case is possible only if u" is a power of a and then the first case holds. This 
first case by injectivity of ip implies u' = u" , that is u" is a factor of t. We then 
find a sequence of episturmian morphisms (/)o, .., G {ipa, i^a\^ ^ A}'^'^^ and a 
sequence of words w,wi,W2,---,Wk such that \w\ > \wi\ > \w2\ > ... > \wk\ = 1, 
w = (/)o((/)i(...((^fc(tt'fc))---)) and Wi = 4>i{(pi+i{. . . {(pkiwk))))- Thus, w is the image of 
a letter by an episturmian morphism, implying that w is c-epichristoffel. 

ii) If z is not standard, by Definition 4, we know that there exists an episturmian 
sequence z' such that F{z) = F{z'). Thus, we can then consider the sequence z' and 
conclude as in i). 

(<^^) Since w is c-epichristoffel, we can write w = f{a), where f £ S" and a A. Let s be 
an episturmian sequence having the factor aa and let consider the episturmian sequence 
f{s). Thus, it contains the factor ww and we conclude. 



7 Concluding remarks 

In this paper, we have most of the time consider the c-epichristoffel words, also known as 
the conjugates of the finite standard episturmian words. Some of the properties of standard 
Sturmian words can be generalized naturally to the c-epichristoffel ones. We unfortunately 
didn't find a characterization of the epichristoffel word of each conjugacy class. Geometrical 
properties of Christoffel words are well known and very interesting. It would be nice to know if 
there is a similar geometrical interpretation for the epichristoffel words. In this paper, we only 
verify if a few properties of the Christoffel words could be generalized or not to the epichristoffel 
ones. Since the literature of Christoffel words is wide, there are still a lot of open problems about 
epichristoffel words. For instance: do they satisfy a kind of balanced property? for a fixed A: > 3, 
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does there exist an epichristoffel word over a /c-letter alphabet of any given length? is it possible 
to give a closed formula for the number of epichristoffel words of a given length? Episturmian 
morphisms have been extensively studied for instance in [JusOl, JP02, RicOSa, RicOSb, JP04, 
Jus05, RicOTb]. It might be useful to use their properties to work on the epichristoffel words. 

Epichristoffel words are still more interesting since they seem to be related to the Fraenkel 
conjecture. This conjecture states that for a finite /c-letter alphabet, there exists a unique 
infinite word, up to letter permutation and conjugation, that is balanced and has pair-wise 
distinct letter frequencies. This unique word, if it exists, is conjectured to be periodic and can 
be written as p"^ , with p an epichristoffel word. Then, knowing more about epichristoffel words 
might help to prove the Fraenkel conjecture. 
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